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Exploiting Reduction Rules and Data Structures: 

Local Search for Minimum Vertex Cover in Massive Graphs 


Abstract 

The Minimum Vertex Cover (MinVC) problem is a 
well-known NP-hard problem. Recently there has been 
great interest in solving this problem on real-world mas¬ 
sive graphs. For such graphs, local search is a promis¬ 
ing approach to finding optimal or near-optimal solu¬ 
tions. In this paper we propose a local search algorithm 
that exploits reduction rules and data structures to solve 
the MinVC problem in such graphs. Experimental re¬ 
sults on a wide range of real-word massive graphs show 
that our algorithm finds better covers than state-of-the- 
art local search algorithms for MinVC. Also we present 
interesting results about the complexities of some well- 
known heuristics. 


Introduction 

The Minimum Vertex Cover (MinVC) problem is a well- 
known NP-h ard problem (|Karp \912\ with many real-world 
applications ( Johnson and Trick 1996) . Given a simple undi¬ 
rected graph G = (V, E) where V is the vertex set and E is 
the edge set. An edge e is a set {u, u} s.t. u,v G V, and we 
say that u and v are endpoints of e. A vertex cover of a graph 
G = (y, E) is a subset V' GV s.t. for each e G E, at least 
one of e’s endpoints is in V'. The size of a vertex cover is 
the number of vertices in it. The MinVC problem is to find 
a vertex cover of minimum size. 

With growing interest in social networks, scientific com¬ 
putation networks and wireless sensor networks, etc., the 
MinVC problem has re-emerged even with greater signifi¬ 
cance and complexity, so solving this problem in massive 
graphs has become an active research agenda. In this paper 
we are concerned in finding a vertex cover whose size is as 
small as possible. 

It is hard to approximate MinVC within any factor smaller 
than 1.3606 ( |Dinur and Safra 2004| l. During last decades 
there were many works in local search for MinVC like 
( |Richter, Helme rt, and Gretton 2007~| |Cai et al. 2013| l. Re- 

cently FastVC ( |Cai 2015 1 makes a breakthrough in massive 
graphs. It makes a balance between the time efficiency and 
the guidance effectiveness of heuristics. However, we realize 
that FastVC exploits very little about the structural informa- 
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tion. Also in order to achieve satisfactory time efficiency, it 
sacrifices the guidance effectiveness. 

The aim of this work is to develop a local search MinVC 
solver to deal with massive graphs with strong structures. 
The basic framework is this. Firstly, we exploit reduction 
rules to construct good starting vertex covers. Then we use 
local search to find better covers. In both the construction 
stage and the local search stage, we exploit a novel data 
structure called alternative partitions to pursue time effi¬ 
ciency, without sacrificing the quality of heuristics. Since 
we are now focusing on the impacts of the reduction rules 
and the data structures, we use naive local search strategies, 
so our solver may be too greedy. For future works, we will 
exploit some strategies to diversify our local search. 

Our solver constructs starting vertex covers by incorpo¬ 
rating reduction rules. In our experiments, our construction 
heuristic performs close to or even better than FastVC, on 
a large portion of the graphs. Moreover, it outputs a cover 
typically within 10 seconds. Hence, it provides a good start¬ 
ing point for later search. Furthermore, for a small portion 
of the graphs, our heuristic guarantees that it has found an 
optimal cover, due to the power of reduction rules. So far as 
we know, this is the first time reduction rules are applied in a 
local search MinVC solver, although they have been widely 
discussed in the community of theoretical computer science. 

We also propose a brand new data structure to achieve 
time efficiency. The main idea is to partition the vertices wrt. 
their scores, i.e., two vertices are in the same partition if they 
have the same score, otherwise they are in different parti¬ 
tions. Thanks to this data structure, (1) as to the construction 
stage, the complexity of two important construction heuris¬ 
tics has been lowered, from 0(|y p) to 0(|y | + |£’|); (2) as 
to the local search stage, the complexity of the best-picking 
heuristic has also been lowered, from 0{\G\) to 0{davg) 
where G is the set of vertices to be selected in local search, 
and davg is the average degree. Later in this paper we will 
prove these results rigorously. We applied these theoretical 
results in our solver, so we call our solver LinCom (Linear- 
Complexity-Heuristic Solver). 

We tested LinCom and FastVC on the standard bench¬ 
mark of massive graphs from the Network Data Repositorjj^ 
( Rossi and Ahmed 20151 1. Our experiments show that among 

*http://www.graphrepository.com./networks.php 


















all the 12 classes of instances in this benchmark, LinCom 
falls behind Fast VC in only one class. Moreover LinCom 
finds smaller covers for a considerable portion of the graphs. 
This improvement is big, since it rarely happens in the liter¬ 
ature ( |Cai 2015| ). 

Preliminaries 

Basic Notations 

If e = {u, u} is an edge of G, we say that u and v are neigh¬ 
bors. We define N{v) as {u G V\{u,v} G E}. The degree 
of a vertex v, denoted by d{v), is defined as |N(?;)|. We use 
davg{G) and d„„x{G) to denote the average degree and the 
maximum degree of graph G respectively, suppressing G if 
understood from the context. An edge e = {u, u} is covered 
by a vertex set S if one of its endpoints is in S, i.e., u G S 
or V G S (or both). Otherwise it is uncovered by S. 

Local Search for MinVC 

Most local search algorithms solve the MinVC problem by 
iteratively solving its decision version-given a positive inte¬ 
ger k, searching for a fc-sized vertex cover. A general frame¬ 
work is Algorithm [T] We denote the current candidate solu¬ 
tion as C, which is a set of vertices selected for covering. 


Both loss and gain are scoring properties of vertices. In any 
step, a vertex v has two possible states: inside G and outside 
G, and we use age(v) to denote the number of steps that 
have been performed since last time w’s state was changed. 


The Construction Stage 

Previous InitVC procedures constract a starting vertex 
cover from an empty set C mainly as below: 


Max-gain: select a vertex v with the maximum gain and 
add V into G, breaking ties randomly. Repeat this proce¬ 


dure until G becomes a cover. (Papadimitrious and Stei- 
glitz 1982| l 


2. Min-gain: Select a vertex v with the minimum posi¬ 
tive gain and add all u’s neighbors into G, breaking 
ties randomly. Repeat this procedure until G becomes a 
cover. Redundant vertices (vertices whose loss is 0) in G 
are then removed. (|Ugurlu 2012| |Kettani, Ramdani, and| 
ITadili 201^ 


3. Edge-greedy: Select an uncovered edge e, add the end¬ 
point with higher degree into G. Repeat this procedure 
until G becomes a cover. Redundant vertices in G are then 
removed by a read-one procedure. (Cai 20151 


Algorithm 1: A Local Search Framework for MinVC 

1 {G,optInfo) ^ InitVGO; 

2 while not reach terminate condition do 

3 if C covers all edges then 

4 G* ^G\ 

5 remove a vertex from C; 

6 exchange a pair of vertices; 

7 return C* 


Algorithm[T]consists of two stages: the construction stage 
(Line[2l and the local search stage (Lineto |^. At the be¬ 
ginning, an initial vertex cover is constructed by the InitVG 
procedure. Throughout this paper, this initial cover is called 
the starting vertex cover. Besides InitVG returns another 
parameter, i.e., opt Info which takes the value optimal- 
guaranteed or optimal-not-guaranteed (See Algorithm]^. 

In the local search stage, each time a fc-sized cover is 
found (Line[^, the algorithm removes a vertex from G (Line 
1^ and begins to search for a (fc — l)-sized cover, until some 
termination condition is reached (Line [^. The move to a 
neighboring candidate solution consists of exchanging a pair 
of vertices (Line[^: a vertex u G C is removed from G, and 
a vertex u ^ C is added into G. Such an exchanging pro¬ 
cedure is also called a step by convention. Thus the local 
search moves step by step in the search space to find a better 
vertex cover. When the algorithm terminates, it outputs the 
smallest vertex cover that has been found. 

For a vertex v G C, the loss of v, denoted as loss{v), is 
defined as the number of covered edges that will become un¬ 
covered by removing v from G. For a vertex v ^ C, the gain 
of V, denoted as gain{v), is defined as the number of un¬ 
covered edges that will become covered by adding v into G. 


Reduction Rules for MinVC 

Our solver will incorporate the following reduction rules in 
the /nitVC procedure to handle vertices of small degrees. 
Degree-1 Rule: If G contains a vertex u s.t. N{u) = {u}, 
then there is a minimum vertex cover of G that contains v. 

The two rules below are from ( |Chen, Kanj, and Jia 200 l| l. 
Degree-2 with Triangle Rule: If G contains a vertex v s.t. 
N{v) = {ni,n 2 } and {ni,n 2 } G E, then there is a mini¬ 
mum vertex cover of G that contains both ni and n 2 . 
Degree-2 with Quadrilateral Rule: If G contains two ver¬ 
tices u and V s.t. N{u) = N{v) = {ni, 77 , 2 } and {ni, 77 , 2 } ^ 
E, then there is a minimum vertex cover of G that contains 
both 771 and 772. 

Since we are to develop a local search solver, we now 
rewrite them in the terminologies of local search. 

Degree-1 Rule: If gain{y) = 1 and u is a neighbor of v s.t. 
u ^ C, then put u into the G. 

Degree-2 with Triangle Rule: If gain{v) = 2, and 771,772 
are both u’s neighbors s.t. ni,n 2 ^ G and { 171 , 772 } G E, 
then put both 771 and 772 into the G. 

Degree-2 with Quadrilateral Rule: If gain{u) = gain{v) = 
2, and both 77i, 772 are neighbors shared by u, v s.t. rii, 772 ^ 
G and { 771 , 772 } ^ E, then put both 771 and 772 into the G. 

Incorporating Reduction Rules 

We incorporate reduction rales in order to: (1) construct 
smaller starting vertex covers; (2) help confirm optimality. 

Constructing A Vertex Cover with Reductions 

Like ( |Cai 2015] l, our InitVG procedure also consists of an 
extending phase (Lines to and a shrinking phase (Line 
[9]|. Notice that if we construct a cover by only using reduc¬ 
tion rules, then it must be optimal. So we employ a predicate 






















A Local Search MinVC Solver 


Algorithm 2: InitVC 

input ; A graph G = {V, E) 

output: A cover C and whether-optimal-guaranteed 

1 0 ; 

2 max-gain_used false', 

3 while there exist uncovered edges do 

4 Repeatedly apply the Degree-2 with Triangle Rule 
until it is not applicable; 

5 Repeatedly apply the Degree-2 with Quadrilateral 
Rule until it is not applicable; 

6 Repeatedly apply the Degree-1 Rule until it is not 
applicable; 

7 if any rule above is applicable then continue if all 
edges are covered then break max_gain_used ^ 
true', 

8 pick a vertex v with the maximum gain (ties are 
broken randomly), put it into C; 

9 C 3r- eliminateRedundantVertices(C'); 

10 if max^gainjAsed = true then 

11 I return (C, optimal-not-guaranteed)', 

12 else 

13 return {C, optimal-guaranteed)'. 


max-gainjused s.t. max_gain_used = true if Linej^has been 
executed, and max-gainMsed = false otherwise. 

In Line [T] we initialize C to be an empty set. Then we 
extend C to be a vertex cover of G, by iteratively adding 
a vertex into G. Line s [4| to apply reduction rules to put 
vertices into C. Line [T] ensures that no reduction rules are 
applicable before making use of the max-gain heuristic. Af¬ 
ter the extending phase (Lines to [^, Line [9| rem oves the 
redundant vertices from G just as what (|Cai 2015 i did. 


Fixing Vertices in the Starting Vertex Cover 

When Algorithm constructs a starting vertex cover, we 
realize that some of the vertices are put into G based 
on pure reductions. That is, they were put into G when 
max^gainMsed = false. Hence, there exist a minimum vertex 
cover which contains all of such vertices, and we call them 
inferred vertices. In local search we can fix the inferred ver¬ 
tices in C s.t. they are never allowed to be removed from G. 
It seems that such a procedure are able to reduce the search 
space and speed up the search. 

So we employ an array fixed, whose element is an indi¬ 
cator for a vertex. During the execution of Algorithm]^ we 
maintain the fixed array as below: 

1. Rule 1: Before the extending phase, for each vertex v, 
fixed[v] is set to false. 

2. Rule 2: When putting a vertex into G, we check whether 
max-gainMsed = false. If so, fixed[v] is set to true. 

Thus when Algorithm|^is completed, fixed[v] = true if v is 
an inferred vertex, and fixed[v] = false otherwise. So later 
when we are doing local search, we can forbid u from being 
removed from G if fixed[u] = true, as is shown in Linem 
and in Algorithm 


Algorithm 3: LinCom(G', cutoff) 

input : A graph G = {V,E), the cutoff time 
output: A vertex cover of G 

1 {C,optInfo) 3^ InitVC{); 

2 if opt Info = optimal-guaranteed then return G while 
elapsed time < cutoff do 

3 if (7 covers all edges then 

4 G* ^C', 

5 remove a vertex u s.t. fixedlv] = false with 
minimum loss from G, breaking ties randomly; 

6 remove a vertex u G G s.t. fixed[u] = false with 
the minimum loss, breaking ties randomly; 

7 e a random uncovered edge; 

8 add the endpoint of e with the greater gain, 
breaking ties in favor of the older one; 

9 return C*; 


Our solver LinCom is outlined in Algorithm At hrst a 
vertex cover is constructed. If the returned cover is guaran¬ 
teed to be optimal, the algorithm will immediately return. 

Then at each step, the algorithm first chooses a vertex 
u G G s.t. u is not an inferred vertex (i.e., fixed[u\ = false) 
with the minimum loss, breaking ties randomly. Then the 
algorithm picks a random uncovered edge e, chooses one of 
e’s endpoints with the greater gain and adds it, breaking ties 
in favor of the older one. 

Data Structures 

In order to lower the complexities, we exploited an efficient 
data structure named alternative partitions (See Figure[T]). 

Alternative Partitions 

We use loss-k (resp. gain-k) partition to denote the par¬ 
tition that contains vertices in G (resp. outside C) whose 
loss (resp. gain) is k (Figure 0. All the loss-k partitions 
are shown as dark regions, and all the gain-k partitions are 
shown as light ones. Since the dark and the light regions are 
distributed alternatively, we call them alternative partitions. 
Obviously we have 

Proposition 1 7. 0 < gain{v) < d{v) < \V\ where v ^ G. 
2. 0 < lossiy) < d{v) < \V\ where v G G. 

Then we use Algorithm|^to hnd those vertices in G with the 
minimum loss. 


Algorithm 4: randomMinLossVertex 
input : A sequence of alternative partitions 
output: A random vertex v G G with minimum loss 

1 fc •<— 0; 

2 while the loss-k partition is empty do k ^ k -\-1 
return a random vertex in the loss-k partition; 

















In this algorithm we first check whether there are any ver¬ 
tices whose loss is 0. If so, we randomly return one of them. 
Otherwise, we go on to check whether there are any vertices 
whose loss is 1, 2, ... until we find a non-empty partition. 
Then we randomly return one in that partition. So we have. 

Proposition 2 The complexity of Algorithm^is 
Similarly we have 

Proposition 3 The complexity of finding the partition with 
the maximum/minimum gain is 0{dm^). 

Implementations 

Given a graph G = (V, E) and a candidate solution C, we 
implement the alternative partitions on an array where each 
position holds a vertex (See Figure[2l. Besides, we maintain 
two additional arrays of pointers, each of which points to the 
beginning of a specihc partition. Imagine the array as a book 
of vertices and the pointer arrays as the indexes of the book. 

Initializing the Partitions At hrst when C is empty, there 
are no dark regions in our data structure, so initializing the 
partitions is equivalent to sorting the vertices into a mono¬ 
tonic nondecreasing order, based on their gain. Notice that 
at this time, the gain of any vertex is equal to its degree, so 
we now need to sort vertices by degrees. By Proposition 
this satishes the assumption of counting sort which runs in 
linear time ( |Cormen et al. 200^ . Thus we have. 

Proposition 4 Initializating the partitions is 0(|1^|). 

Maintaining the Partitions After initializations, there are 
two cases in which a particular vertex, say v, has to be 
moved from one partition to another: (1) adding (resp. re¬ 
moving) V into (resp. from) C; (2) increasing/decreasing 
gain{v)lloss{v) by 1. Thus the core operation is to move 
a vertex v to an adjacent partition. 



Figure 1: Adding vqs into C (a) 
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Figure 2: Adding ves into C (b) 



Figure 3: Adding vqs into C (c) 


Now we show how to do this with an example (See Figure 
[T]to|^. In this example, we are to add vqs into C. Initially 
Ugi and vqs are in the gain-52 partition and thus their gain is 
52 (Figure [T]). Notice that after being added, ugs’s loss will 
become 52, i.e., it should be in the loss-52 partition. Thus 
the operation is performed like this: (1) Ugs is swapped with 
ugi (Figure]^; (2) Pgain -52 is moved (Figure]^. 

We dehne placeVertexIntoC(t;) as the procedure that 
moves V from certain gain-k partition to the respective 
loss-k partition, puts it into C and updates its score. 
And we define gainMinusMinus(z;) as the procedure that 
moves V from certain gain-k partition to the respective 
gain-{k — 1) partition and updates its score. Analogously 
we dehne placeVertexOutfromC(v), lossMinusMinus(u), 
gainPlusPlus(r;), and lossPlusPlus(r;). Then we have 

Proposition 5 All the procedures are ofO{l) complexities. 


Complexity Analysis 

In this section, we evaluate the complexities of the best¬ 
picking and the vertex cover construction heuristics. 


Complexity of The Best-picking Heuristic 

Along with adding/removing a vertex v, we have to move 
this vertex and all its neighbors to other partitions. Thus by 
Proposition 1^ maintaining the partitions will take 0(1) time 
plus an amount of time proportional to d{v). Thus, 


Proposition 6 When a vertex is added/removed, the com¬ 
plexity of maintaining the partitions at each step is 0{d^f). 

By Proposition]^ and 1^ we have 

Proposition 7 The best-picking heuristic in Algorithm 
can be done in O(dmaK) complexity. 

In the local search stage, by Proposition [T] we have 


Theorem 8 Suppose that each vertex has equal probability 
to be added or removed, then the average complexity of the 
best-picking heuristic in Algorithm^is O(davg)- 

It is nice because ( |Cai 2015] l stated that the best-picking 
heuristic was of 0(\C\) complexity. Since most real-world 
graphs are sparse (|Barabasi and Albert 1999HEubank et al.| 


2004 Chung et al. 2006|l, we have da„g « \C\7 


Complexity of The Max-gain/Min-gain Heuristics 

( |Cai 2 OI 5 I 1 formally proved that the max-gain heurist ic had a 


worst-case complexity of 0{\Vf). Moreover, both (Ugurlu 

















































Algorithm 5: minGainConstructVC 
input ; A graph G = {V, E) 

output: A cover C and whether-optimal-guaranteed 

1 C \UE\ \E\-, initialize the partitions; 

2 while \UE\ > 0 do 

3 h i — 1 ; 

4 while the gain-k partition is empty do fc •<— fc + 1 
miri-gju ^ a random vertex in the gain-k partition; 

5 foreach v G N{min-g-v) do 

6 if z) G C then continue placeVertexIntoC(z;); 
\UE\ ^ \UE\ — gain{v); 

7 foreach n G N{v) do 

8 if n G C then lossMinusMinus(n) else 
gainMinusMinus(n) 

9 return (C, optimal-not-guaranteed)'. 


2012[) and (|Kettani, Ramdani, and Tadili 2013 i proved rigor¬ 


ously that the worst-case complexity of the min-gain heuris¬ 
tic was 0{\V\‘^). Yet with the alternative partitions, we have 


When the condition in Line|^is tested, there are two cases: 
(1) if succeeds, then k is decreased by 1; (2) if fails, then 
one vertex is put into C. So the number of tests done in 
Linej^is exactly \C\ -\- d„ax < \C\ -f |y|. Similarly, the 
overall complexity is 0{\V\ -f \C\ -f \E\). 


Algorithm 6; maxGainConstructVC 

input : A graph G = {V, E) 

output: A cover C, whether-optimal-guaranteed 

1 C ^ 0; \UE\ ^r- \E\', initialize the partitions; 

2 k -(r- d„,a/, 

3 while \UE\ > 0 do 

4 while the gain-k partition is empty do A: •<— fc — 1 
V ^a random vertex in the gain-k partition; 

5 placeVertexlntoC(w); \UE\ ^ \UE\ — gain{v); 

6 foreach n G N{v) do 

7 if n G C then lossMinusMinus(n) else 
gainMinusMinus (n) 

8 return {C, optimal-not-guaranteed); 


Theorem 9 The min-gain/max-gain heuristic constructs a 
vertex cover in 0(|1/| -f \C\ -I- |£'|) complexity, where G is 
the starting vertex cover. 


Proof: We use UE to denote the set of uncovered edges. 


1. We prove the case for min-gain by Algorithm 0 By 
Proposition]^ Line 0 has a complexity of 0(|F|). 

In aiw cycle of the outer while-loop, if the condition in 
Linefflis tested for t times, then gain{min-g jv) = t, and 
thus t neighbors of miri-g-V will be put into G. That is, in 
any cycle, the number of tests done in Line 0 is equal to 
the number of vertices that will be put. So that condition 
will be tested for exactly \G\ times during the algorithm. 


Given min-gain ju in Line 0 the algorithm tests each of 
its neighbors whether they are in G in Line 0 Consider¬ 
ing the case that we have to test every neighbor of every 
vertex, the total number of tests done is 2\E\. Thus the 
condition in Line0will be tested for at most 2\E\ times. 

After putting a vertex into G in Line 0 we have to up¬ 
date the information about its neighbors (Line 00. Again 
considering the extreme case above, the total number of 
updates (gainMinusMinus or lossMinusMinus) will be at 
most 2|£^|. By Proposition 0 the time spent in Line 


during the algorithm is 0{ f^l) . To conclude, the overall 
complexity is 0{\V\ \C\ -f |£^|). 


2. We prove the case for max-gain by Algorithm0 

In Line 0 we initialize k to be d,„a_^ which is equal to the 
maximum gain at this time. Notice that the value of the 
maximum gain never increases in the construction stage. 
So during the execution, whenever we hnd that there are 
no vertices whose gain is g, we go on to check whether 
there are any vertices whose gain is p — 1. Thus, during 
the execution, k is always the value of the maximum gain. 


Besides, we compared Algorithm 0 with the traditional 
one ( |Cai et ah 201 3| l through experiments. Moreover, as to 
the min-gain heuristic, we program it ourselves in two ways: 
Algorithm0and the previous way. It shows that our methods 
are faster than the traditional ones by orders of magnitude on 
large instances. So our experimental results were completely 
consistent with the theoretical expectations. So far we have 
not derived the complexity of Algorithm 0 yet, but we be¬ 
lieve that it is also linear, because our InitVG procedure 
outputs a vertex cover typically within 10 seconds. 

Because the max-gain heuristic was proposed about three 
decad es ago ( [Papadimitrious and Steiglitz 1982) 1, and ( |Cai| 
2015|l still proved the 0{\V\'‘^) complexity, our result is sur¬ 
prising. Note that partitioning is a general method and can 
also be applied to solve huge instances for other problems. 


Experimental Evaluation 

In this section, we carry out extensive experiments to evalu¬ 
ate LinCom on massive graphs, compared against the state- 
of-the-art local search MinVC algorithm FastVC. To show 
the individual impacts, we also present the performances of 
our InitVG procedure (named as InitVC in the tables). 


Benchmarks 

We downloaded all 139 instance^ They were originally on¬ 
line]^ and then transformed to DIMACS graph format. But 
we excluded three extremely large ones, since they are out 
of memory for all the algorithms here. Thus we tested all 
the solvers on the remaining 136 instances. Some of them 
have recently been used in testing parallel algorithms for 


Maximum Clique and Coloring problems (Rossi and Ahmed 
[M4l [Rossi et al. 2014| l. 


^http://lcs.ios.ac.cn/caisw/Resource/realworld%20graphs.tar.gz 
^http://www.graphrepository.com./networks.php 































Table 1: Experimental results on collaboration networks, 
facebook networks, interaction networks, infrastructure net- 


works, recommend networks and retweet networks 


Graph 

FastVC 

InitVC 

LinCom 

A 





ca-AstroPh 

11483 (11483) 

11483 (11483.36) 

11483 (11483.01) 

0 

ca-citeseer 

129193(129193) 

129193 (129193.82) 

129193(129193.36) 

0 

ca-coauthors-dblp 

472179 (472179) 

472234 (472242.19) 

472179 (472179.02) 

0 

ca-CondMat 

12480(12480) 

12481 (12481.25) 

12480(12480.06) 

0 

ca-dblp-20I0 

121969 (121969) 

121970(121971.02) 

121969(121969.64) 

0 

ca-dblp-2012 

164949(164949) 

164949 (164950.88) 

164949(164950.35) 

0 

ca-hollywood-2009 

864052 (864052) 

864052 (864053.9) 

864052(864052.01) 

0 

ca-MathSciNet 

139951 (139951) 

139951 (139952.45) 

139951 (139952.23) 

0 

socfb-A-anon 

375231 (375232.94) 

375230 (375230.82) 

375230(375230.82) 

1 

socfb-B-anon 

303048 (303048.93) 

303048 (303048) 

303048 (303048) 

0 

socfb-Berkeleyl3 

17209(17212.18) 

17280(17290.32) 

17210(17215.93) 

-1 

socfb-CMU 

4986 (4986.72) 

5002 (5007.41) 

4986 (4987.24) 

0 

socfb-Dukel4 

7683 (7683.05) 

7707 (7712.34) 

7683 (7684.98) 

0 

socfb-Indiaiia 

23313 (23317.19) 

23426 (23439.12) 

23319(23323.79) 

-6 

socfb-MIT 

4657 (4657) 

4663 (4669.13) 

4657 (4657.56) 

0 

socfb-OR 

36547 (36549.44) 

36586 (36594.26) 

36548 (36549.50) 

-1 

socfb-Penn94 

31161(31164.95) 

31299 (31313.34) 

31165 (31170.78) 

-4 

socfb-Stanford3 

8517 (8517.89) 

8534 (8540.01) 

8518 (8518.35) 

-1 

socfb-Texas84 

28166 (28171.54) 

28306 (28317.76) 

28169 (28178.98) 

-3 

socfb-UCLA 

15222(15224.41) 

15279(15294.25) 

15224(15228.85) 

-2 

socfb-UConn 

13230(13231.60) 

13287 (13300.16) 

13232(13235.99) 

-2 

socfb-UCSB37 

11261 (11262.88) 

11310(11316.65) 

11262(11265.54) 

-1 

socfb-UF 

27305 (27309.04) 

27440 (27453.23) 

27310(27316.25) 

-5 

socfb-UIllinois 

24090 (24093.97) 

24209 (24222.07) 

24095 (24101.18) 

-5 

socfb-Wisconsin87 

18383 (18385.46) 

18468 (18483.70) 

18384(18390.13) 

-1 

ia-enron-large 

12781 (12781) 

12781 (12781.2) 

12781 (12781.2) 

0 

inf-power 

2203 (2203) 

2203 (2203.01) 

2203 (2203.01) 

0 

inf-roadNet-CA 

1001254(1001325.29) 

1007098 (1007362.34) 

1001058 (1001139.61) 

196 

inf-roadNet-PA 

555203 (555248.74) 

558206 (558343.72) 

555035 (555107.22) 

168 

rec-amazon 

47606(47606.01) 

47605 (47611.64) 

47605 (47605.62) 

1 

rt-retweet-crawl 

81044(81047.81) 

81040(81040) 

81040(81040) 

4 


Table 2: Experimental results on scientific computation net¬ 
works, social networks, technological networks, temporal 


reachability networks and web link networks 


Graph 

FastVC 

^mini^avp) 

InitVC 

LinCom 

A 

sc-ldoor 

856754 (856757.36) 

858142 (858173.08) 

856755 (856757.18) 

-1 

sc-msdoor 

381558 (381559.23) 

382102 (382120.66) 

381559 (381559.86) 

-1 

sc-nasasrb 

51242 (51247.27) 

51575 (51605.64) 

51243 (51249.23) 

-1 

sc-pkustkil 

83911 (83912.97) 

84124(84146.02) 

83911 (83913.52) 

0 

sc-pkustkl3 

89217(89220.46) 

89625 (89652.49) 

89219(89222.95) 

-2 

sc-pwtk 

207711 (207720.22) 

208713 (208760.96) 

207698 (207711.11) 

13 

sc-shipsecl 

117305 (117338.65) 

118727 (118788.57) 

117278(117319.88 ) 

27 

sc-shipsec5 

147140(147179.12) 

147656(147710.75) 

146991 (147022.95) 

149 

soc-BlogCatalog 

20752 (20752) 

20752 (20752.01) 

20752 (20752.01) 

0 

soc-brightkite 

21190 (21190) 

21190 (21190.09) 

21190 (21190.09) 

0 

soc-buzznet 

30625 (30625) 

30613 (30613) 

30613(30613) 

12 

soc-delicious 

85660 (85696.77) 

85343 (85364.83) 

85319(85333.75) 

341 

soc-digg 

103243 (103244.72) 

103234(103234.01) 

103234(103234.01) 

9 

soc-epinions 

9757 (9757) 

9757 (9757.02) 

9757 (9757.02) 

0 

soc-flickr 

153272(153272.03) 

153271 (153274.09) 

153271 (153271.45) 

1 

soc-flixster 

96317(96317) 

96317(96317.02) 

96317 (96317.02) 

0 

soc-FourSquare 

90108 (90109.09) 

90108 (90108.13) 

90108 (90108.13) 

0 

soc-gowalla 

84222 (84222.36) 

84222 (84224.28) 

84222 (84222.07) 

0 

soc-livejoumal 

1869044(1869054.64) 

1868997(1869010.13) 

1868924(1868932.92) 

120 

soc-pokec 

843419(843432.58) 

843768 (843783.01) 

843344 (843347.38) 

75 

soc-youtube 

146376(146376.13) 

146376(146376.35) 

146376(146376.1) 

0 

soc-youtube-snap 

276945 (276945) 

276945 (276945.21) 

276945 (276945.21) 

0 

tech-as-skitter 

527161 (527204.59) 

525132(525149.68) 

525086 (525099.14) 

2075 

tech-RL-caida 

74924 (74940.83 ) 

74618(74625.67) 

74607 (74615.25) 

317 

scc.infect-dubliii 

9104(9104) 

9110(9112.56) 

9103 (9103) 

1 

see _retweet-crawl 

8419(8419) 

8419(8419.02) 

8419 (8419.02) 

0 

web-arabic-2005 

114425 (114427.28) 

114431 (114435.40) 

114420(114420.67) 

5 

web-BerkStan 

5384 (5384) 

5388 (5388.13) 

5384 (5384.13) 

0 

web-it-2004 

414671 (414675.12) 

414854 (414874.98) 

414646 (414649) 

25 

web-spam 

2298 (2298.01) 

2297 (2298.07) 

2297 (2297.26) 

1 

web-wikipedia2009 

648315 (648321.83) 

648385 (648401.24) 

648300 (648312.39) 

15 


Experiment Setup 

All the solvers were compiled by g-n- 4.6.3 with the ’-03’ 
option. E or EastVCp] we adopt the parameter setting re¬ 
ported in ( |Cai 20 15) ^ The experiments were conducted on 
a cluster equipped with a number of Intel(R) Xeon(R) CPUs 
X5650 @2.67GHz with 8GB RAM, running Red Hat Santi¬ 
ago OS. 

All the algorithms are executed on each instance with a 
time limit of 1000 seconds, with seeds from 1 to 100. Eor 
each algorithm on each instance, we report the minimum 
size and averaged size {’’Cavg”) of vertex covers 

found by the algorithm. To make the comparisons clearer, 
we also report the difference (”A”) between the minimum 
size of vertex cover found by EastVC and that found by 
LinCom. A positive A means that LinCom finds a smaller 
vertex cover, while a negative A means that EastVC finds 
a smaller vertex cover. The numbers of vertices of these 
graphs lie between 1 x 10^ to 4 x 10®. We omit them and 
readers may refer to ( |Cai 2015) 1 or the download website. 

Experimental Results 

We show the main experimental results in Tables [T] and 
Eor the sake of space, we do not report the results on graphs 
with less than 1000 vertices. Eurthermore, we do not report 
the results on graphs where LinCom and EastVC precisely 
return both the same minimum size and average size. 

Erom the results in Tables [T] andwe observe that; 

1) LinCom attains the best known solutions for most in¬ 
stances, and makes a significant progress. In Eact, among all 
the 136 tested instances LinCom has found covers with 26 

"'http://lcs.ios.ac.cn/ caisw/Code/FastVC.zip 


less vertices on average. This improvement is big, since it 
rarely happens to find a better solution ( Cai 2015] ). 

2) LinCom is more robust. Actually out of 12 classes, 
LinCom outperforms EastVC over 7 classes, while EastVC 
outperforms LinCom over 1 class (e.g., facebook networks). 
It seems that our local search is too greedy and not as effec¬ 
tive as EastVC for facebook networks. 

3) There are quite a few instances (e.g., soc-delicious) 
where InitVC outperforms EastVC. This illustrates that our 
InitVC procedure generates desired starting vertex covers. 

Eurthermore, the solutions to the following 9 instances are 
guaranteed to be optimal: ca-CSphd, ca-Erdos992, ia-email- 
EU, ia-reality, ia-wiki-Talk, soc-douban, soc-LiveMocha, 
soc-twitter-follows, tech-internet-as. So our Jn/fUC proce¬ 
dure is sometimes complete in practice. 


Conclusions and Future Work 

In this paper, we have developed a local search algorithm for 
MinVC called LinCom, based on reduction rules and data 
structures. The reduction rules help generate a better qual¬ 
ity starting vertex cover, while the data structures lower the 
complexities of the heuristics. 

The main contributions are two folds: (1) we have lowered 
the complexity of two vertex cover construction heuristics 
and the best-picking heuristic based on the score-based alter¬ 
native partitions at the theoretical level; (2) we apply these 
results and some reduction rules to develop a local search 
solver which outperforms the state-of-the-art. 

As for future works we will utilize various diversifica¬ 
tion strategies to in our solver. Also, we will apply reduction 
rules to select vertices for exchanging in local search. 
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